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In this paper we present the derivation details, logic, and motivation for the loop calculus intro- 
duced in [l|. Generating functions for three inter-related discrete statistical models are expressed 
in terms of a finite series. The first term in the series corresponds to the Bethe-Peierls/Belief- 
Propagation (BP) contribution, the other terms are labeled by loops on the factor graph. All loop 
contributions are simple rational functions of spin correlation functions calculated within the BP 
approach. We discuss two alternative derivations of the loop series. One approach implements a set 
of local auxiliary integrations over continuous fields with the BP contribution corresponding to an 
integrand saddle-point value. The integrals are replaced by sums in the complimentary approach, 
briefly explained in [lj. Local gauge symmetry transformations that clarify an important invariant 
feature of the BP solution, are revealed in both approaches. The individual terms change under 
the gauge transformation while the partition function remains invariant. The requirement for all 
individual terms to be non-zero only for closed loops in the factor graph (as opposed to paths with 
loose ends) is equivalent to fixing the first term in the series to be exactly equal to the BP contri- 
bution. Further applications of the loop calculus to problems in statistical physics, computer and 
information sciences are discussed. 

PACS numbers: 05.50.+q,89.70.+C 



One practically useful yet generally heuristic approach used for calculations of observables (correlation functions) 
in discrete statistical physics models, e.g. Ising model, is related to the so-called Bethe-Peierls (BP) approximation 
[1, H, [3 . The BP approach is exact for graphs that do not contain loops, usually referred to as trees; otherwise the 
approach is approximate. The ad-hoc approach can also be re-stated in a variational form @,d,0|- A similar tree-based 
method in information science has been developed by Gallager 8, 9] in the context of error-correction theory. Gallager 
introduced the so-called Low-Density-Parity-Check (LDPC) codes, defined on locally tree-like Tanner graphs. The 
problem of ideal decoding, i.e. restoring the most probable pre-image out of the exponentially large pool of candidates, 
is identical to solving a statistical model on the graph [id ]. An approximate yet efficient Belief-Propagation decoding 
algorithm introduced by Gallager constitutes an iterative solution of the Bethe-Peierls equations derived as if the 
statistical problem was defined on a tree that locally represents the Tanner graph. We utilize this abbreviation 
coincidence to call Bethe-Peierls and Belief-Propagation equations by the same acronym - BP. Recent resurgence of 
interest to LDPC codes [HI, [13], as wei ^ as proliferation of the BP approach to other areas of information and computer 
science, e.g. artificial intelligence [l3| and combinatorial optimization [lj|, [l5|, Gil, where interesting statistical models 
on graphs with long loops appear, made the BP approach to be one of the most interesting and hot research topics 
in modern information and computer sciences. 

In spite of the lack of analytical control in the general case of graphs/models with loops, the BP approximation 
and the corresponding algorithm provide remarkably accurate results. Based on this empirical observation one would 
expect an existence of a hidden mathematical structure that can rationalize an inessential, sub-leading role of the 
corrections associated with the loops. Besides, an in depth understanding of the BP success would also provide a 
practical guidance for improving BP even further by accounting for non-local loop-related correlations. However, 
with the exception of two recent papers [13, EU , the discussion of this important point was largely superficial and 
anecdotal. The Ising model (pair-wise interactions between the bits) on a graph with loops has been considered 
by Montanari and Rizzo [l7j], where a set of exact equations has been derived that relates the correlation functions 
to each other. This system of equations is under-defined; however, if irreducible correlations are neglected the BP 
result is restored. This feature has been used (l7| to generate a perturbative expansion for corrections to BP in 
terms of irreducible correlations. A complementary approach for the Ising model on a lattice has been taken by 
Parisi and Slanina [l8j |. who utilized an integral representation developed by Efetov [l9| in early nineties. The saddle- 
point for the integral representation used in 18] turns out to be exactly the BP solution. Calculating perturbative 
corrections to magnetization, the authors of [18j encountered divergences in their representation for the partition 
function, however, the divergences canceled out from the leading order correction to the magnetization revealing a 
sensible loop correction to BP. These papers, [TtJ and [Hj], became important initial steps towards calculating and 
understanding loop corrections to BP. However, both approaches are very far from being complete and problem-free. 
Thus, [17| lacks an invariant representation in terms of the partition function. Instead it requires operating with 
correlation functions. Besides, the complexity of the equations related to the higher-order corrections rapidly grows 
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with the order. The complimentary approach of |18j contains dangerous, since lacking analytical control, divergences 
(zero modes), which constitutes a very problematic symptom for any field theory. Both [17| and [181 ] focus on the Ising 
pair-wise interaction model. The extensions of the proposed methods to the most interesting from the information 
theory viewpoint multi-bit interaction cases do not look straightforward. Finally, the approaches of 17 1 and (l8| . if 
extended to higher-order corrections, will result in infinite series. Re-summing the corrections in all orders, so that 
the result is presented in terms of a finite series, does not look feasible within the proposed techniques. 

In [l| we suggested an ultimate way to account for loop corrections to BP. We represented the partition function 
for a general discrete statistical model defined on a finite factor graph in terms of a series decomposition. The most 
remarkable feature of the suggested exact decomposition, that does not appear within the previous approaches [r7Hl8l|. 
is the representation of the partition function as a finite (!!) series with the first term being exactly represented by 
the BP solution. All higher-order terms are labeled by generalized loops in the factor graph. A generalized loop is 
defined as a possibly branching undirected path in the factor graph that has no loose ends. Each term in the series is 
represented as a product of local contributions along the loop, each contribution being expressed explicitly in terms 
of some correlation functions calculated within the BP approximation. 

The present manuscript generalizes and details the approach of [l[. In addition to explaining all technical details 
of the loop series derivation of [l[ we also provide an alternative approach based on an integral representation for the 
partition function. For the integral representation BP appears as a result of applying the saddle-point approximation. 
We pay special attention to clarifying the relation between the saddle-point approximation for the integral and the 
Bethe Free energy approach of Q, as well as between the analysis of the Gaussian corrections and the saddle-point. 
We also provide a technical rationale for a formal gauge transformation in the integral representation for the model 
partition function (transformation of variables and decomposition of the integrand in a series) that results in the loop 
series expression. 

The integral representation approach is formulated for a bipartite factor graph model fli, [20L |22| which is a particular 
case of the general vertex model of [2l| also considered in the manuscript. For the presentation clarity we introduce a 
bipartite vertex model, orientable vertex model, that is less general than the general vertex model, yet constitutes a 
generalization of the bipartite factor graph model [23| . The vertex models are more general compared to the bipartite 
factor graph model and allow a simpler derivation of the loop series using an auxiliary discrete transformation (discrete 
Fourier transforms) in place of its integral counterpart. We actually start the technical part of the paper by describing 
a simpler and more compact discrete variable representation before turning to a lengthy, still ideologically important, 
integral (continuous variables) counterpart. 

The auxiliary degrees of freedoms, one per the graph edge, introduced within both integral/continious and 
sum/discreet approaches, possess a gauge symmetry that allows an invariant definition of the BP equation. Gauge 
transformation corresponding to the symmetry keeps the full expression for the partition function invariant while 
changing the individual term of the series. An individual term corresponds to a path on the graph that may generally 
contain some number of loose ends. The BP equations can be viewed as conditions for fixing a special gauge that 
requires all allowed paths (i.e. those who contribute to the series) to be nothing but generalized loops that do not 
contain any loose ends. For this special gauge the BP approximation is described by the first bare term in the loop 
series. 

The formulation of BP as a gauge fixing condition also allows a clear physical interpretation of the entire approach. 
Indeed, the first bare term of the loop series can be viewed as a "ground state" that minimizes the Bethe free energy 
with loop corrections being related to certain excited states described as along-the-loops spin flips with respect to 
the ground state. Such interpretation of the loop series makes our approach similar to the so-called high-temperature 
expansion, where individual contributions (diagrams) also correspond to close loops on the factor graph. There is, 
however, a very important key difference between the loop series and the high-temperature expansion. While the 
high-temperature expansion starts with a trivial bare term (just unity in the expansion of the partition function) the 
bare term in the loop series is highly nontrivial. It is represented by the BP approximation that already accounts for 
some local correlations in the model. 

The manuscript is organized as follows. In Section [J we introduce our three basic models: bipartite factor-graph 
model, orientable (bipartite) vertex, and general vertex models. Vertex models are convenient generalizations of the 
bipartite factor-graph model. In Section U we also state our major result - exact expressions for the models' partition 
functions in terms of finite series, coined loop-series, over closed paths defined in the models' graphs. The rest of 
the paper is devoted to derivations and discussions of these results. Straightforward and simple derivation of the 
loop series for the vertex model is described in Section |TTJ BP-equations emerge as a result of a requirement for 
the finite series representation for the partition function of the model to have the loop-series form (with no terms, 
correspondent to a path with loose ends, present). In Section [Ml we derive the loop-series for the factor-graph model 
via an integral representation. This derivation is more involved, however we present it here in full as it allows to 
establish a relation between the loop-calculus and other approaches in theoretical physics, e.g. saddle point analysis. 
Section IIIII contains a number of Subsections. Integral representation for the partition function of the factor-node 
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model is introduced in Subsection IIII Al Subsection IIII Bl describes the relation of the integral representation to the 
Bethe free energy variational approach of [7| . The latter is also briefly sketched in Appendix [X] In Subsection IIII CI 
we present an approximate saddle-point analysis of the integral representation for the partition function of the factor- 
graph model. Here we show that the saddle-point is described by the BP equations. The Gaussian approximation 
around the BP saddle-point is discussed in Subsection IIII Dl Finally, the derivation of the exact loop-series via the 
integral representation is described in Subsection IIII El Section IIVI is devoted to conclusions where we also discuss 
possible generalizations as well as practical utility of the loop-series/caclulus in information/computer science and 
statistical physics. Appendix [B] illustrates the loop calculus on a simple example of two bits, two checks bipartite 
factor graph model with single loop. 



I. LOOP SERIES FOR THE FACTOR-GRAPH AND VERTEX MODELS 



A. Bipartite factor-graph model 

Consider a generic discrete statistical model, with configurations characterized by a set of binary variables: aj = ±1, 
i = 1, • • • , n, which is factorized so that the probability p{(Ji\ to find the system in the state and the partition 
function Z are 



(1) 



where a labels non- negative and finite factor-functions f a with a = l,...,m and a a represents a subset of oi 
variables. Relations between factor functions (checks) and elementary discrete variables (bits), expressed as i G a 
and a 9 i, can be conveniently represented in terms of the system-specific factor graph. If i £ a we say that the 
bit and the check are neighbors. An example of a factor graph with m = 4 that corresponds to p{u\, <J%, 03, 04) = 
Z~ 1 fa{<ra)fb(er b )f c (cr c )f d (er d ), where a a = (ai,a 2 ), o h = (<7i, cr 2 , 03), <J C = (<7i , 03 , 04) , a d = (03,04) and a = a,b,c,d, 
is shown in Fig.[TJ Any spin correlation function can be calculated using the partition function, Z, defined by Eq. (fT]). 
For example, the bit i magnetization is expressed as 



(*i> 



dlnZ 



dhi 



(2) 



where the following transformation of a factor node function associated with a check a neighboring the bit % is assumed: 
f a (cr a ) -> f a {<J a ) exp^CTj). 




FIG. 1: Example of a factor graph. Fourteen possible marked paths (generalized loops) for the example are shown in bold on 
the bottom. 
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B. Vertex models 

In this subsection we discuss vertex models of two types, orientable/bipartite and general. Similar to the factor 
graph model, the vertex models are formulated in terms of Ising spin variables, a = ±. However, while in the factor 
graph model spins reside in the bit nodes, spins in the vertex models are assigned to the edges. The orientable 
vertex model generalizes the factor-graph model described in the previous Subsection, while the general vertex model 
generalizes the orientable vertex model. 

1. Orientable Vertex model 

A graph is orientable if the whole family of its nodes, X, can be partitioned in two sub-families, such that nodes 
of one sub-family neighbors only nodes from the opposite sub-family. Also if a connected graph is orientable, there 
are exactly two different global orientations: A global orientation is chosen by picking some node on a graph and 
identifying it as left (or right). Choosing an orientation on an orientable graph partitions the set of nodes X = XlUXr 
into the subsets of left and right nodes, referred to as bit nodes and check nodes, respectively. Ising variables in the 
vertex model reside in the graph edges, i.e. the configurations are defined by sets of Ising variables a c = ±1 for 
c £ X\. For a graph with a chosen orientation it is also convenient to represent these variables as <7j a , with a £ Xr 
and j £ Xr, representing the check (right) and bit (left) end of an edge. The weight (probability) of a configuration 
is given by a product of weights related to the nodes: 

p ov (a)=z- v i n /j.fo) n /„(*•„), z ov =y. n n ^ 

jex L aex R {<r}jex L aex R 

A particular example of the oriented vertex model defined for the graph shown in Fig. [1] corresponds 

to p ~ /l(oi a , an,, Plc)h{&2ai C2b)/3(CT3b, Cr 3c , 3d) fi(a 4 C , CT4d)/a (c a l , CT a2 ) fb (<Jbl , <7b2 , Cb3)/c(c c l , & c3 , <^ci)fd{^d3 , & di) , 

where we do not differentiate between the bits and checks and the index order for a spin defined on the graph edge 
is not important. 

Obviously, the oriented vertex model ([3]) turns into the factor-graph model ((T|) if the functions fj adopt the following 
form 

f (rT \ _ / 1, Via. = Oijj Wa,P 3 i 

JjK<Tj) ~ | o, otherwise. [ > 

2. General Vertex model 

A general vertex model is determined by the weight function that can be represented in the following form 

Pg V (cr) = Z-J Y[ /o(oVx), Zgv = Y^ II /"( cr «)' ( 5 ) 

aeX v a£X 

where a denotes a vertex in the model; elementary spin is defined at the edge connecting two neighboring vertexes, 
a a b for b £ a and a £ 6; <r a stands for the vector built from all a a b where b £ a; er is a particular configuration of 
spins on all the edges. It is important to realize that with this notation we need to assume that a a b = <Jba- 

A general vertex model turns into the orientable vertex model if the whole family of bits {a} is divided in two 
subfamilies that correspond to checks and bits, a = i © a, and additionally for any bit/check the neighbors belong to 
the opposite families. 

Therefore, for the example shown in Fig. [1] the oriented vertex model and the the general vertex model simply 
coincide as the graph allows partitioning in two parts. A simple example of a general vertex model which does not 
correspond to any oriented case (the whole family of nodes is not divisible in two groups) is given by an interconnected 
triad of vertexes with pair interaction: p ~ fi(a\2, cri3)/2(<X2i, 023)/3(c3ii 032). 

C. Loop series 

In this Subsection we state the main result of the paper for the three models introduced above. 
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1. General Vertex model 



We start with the general vertex model. The partition function of the general vertex model, described by Eq. ([5]) 
is exactly equal to 



Z gv = Z (l + J^r^fC)^ 



11 Ma 



ae c 

[ ( 
(ab)eC 



rUC)= n d-m^y (6) 



where summation goes over all allowed C (marked) paths in the graph associated with the model; (ab) marks the 
edge on the graph connecting nodes a and b. The marked path is allowed to branch at any node/ vertex, however it 
cannot terminate at a node. We refer to such a structure as a loop (it is actually some kind of a generalized loop since 
branching is allowed; we use the shorter name for convenience). m a b is the magnetization at the edge that connects 
nodes a and b. /z a is the irreducible correlation function at node a. The order of the correlation function is equal to 
the number of marked nodes (nodes belonging to the marked path C) neighboring a. The bare partition function 
Zq, the magnetization m a b, and the correlation functions /i a are calculated within the BP procedure, described by 
Eqs. (I33I39I40I41I42I) . 



2. Orientable Vertex model 



The general formula ((6|) applied to the case of the nodes/vertexes partitioned into bits and checks, reads 



= Z [l + Y, r ov(C)J 



n m* n m/3 

_ sec J \p e c J 

ov{C) - n (i-o ' ( ) 

(ta)eC 



where summation goes over all allowed (marked) paths C in the graph associated with the model. A marked path 
(generalized loop) is allowed to branch at any bit/check, however, it may not terminate at a bit or check. In this case 
there are two types of irreducible correlation functions associated with bits and checks, respectively, and one type of 
magnetization ( which is associated with the edge that connects any bit with its neighboring check that necessarily 
belongs to the loop) entering Eq. ([7]), all calcuated within BP and defined in Eqs. (|29I30I31[) . Eq. ([7]) also follows 
directly from formulas of Section III Al 



3. Factor Graph model 



The decomposition of the partition function defined by Eq. (Q} into a finite series has a form: 

(8) 



Zfg = Z I 1 +^r /g (C) J , r fg (C) = *[[ MaMi 

V C I i,aEC 



(l-mi)"*- 1 + (-l)9'(l + m l )?^ 1 
2(1 - mf)-?.- 1 



/'< = ... , '^ 2 ; g ,ii — - — - » = E 1 ' { ' )] 



aGC 



i a = y]b a (a a ) _Q(o"i - mi), m l = ^ b l {<J i )a l . (10) 



where summation goes over all allowed (marked) paths C (generalized loops) . They consist of sets of bits and checks 
so that each of them has at least two distinct neighbors on the path. For the aforementioned example there are 
fourteen allowed marked paths (loops) shown in Fig. |T]) on the right. In Eqs. ijHJ) &i(o"j), b a (a a ) and Z are beliefs 
(probabilities) defined on bits and checks and partition function, respectively, calculated for the BP solution. The BP 
solution for the model is described in detail in Section HTT1 see also Appendix [Al 

It is easy to verify that if Eq. ([4]) is assumed for the generalized vertex model, Eq.([7]) turns exactly into Eq. ([8]). 
Indeed, under condition of J4} the irreducible correlation functions at a check in the two formulae are exactly equivalent. 
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One derives 



W - / ^(^(^-mO 9 * =^^[(1- m^- 1 + (-!)«(! + rnO 94 " 1 ], (11) 



J] (l- m fj^(l- TO |)« % (12) 

a£i,C 

where the definition of the da integration (summation) is given in Section Til Al Pi{(Ji) = (l + crjra,)/2 is the probability 
to find bit i in the state <jj within the BP solution, and qi is the connectivity degree of the bit i at the marked subgraph 
C, defined by Eq. ©. All together the equivalence is completely restored. 

II. LOOP SERIES DERIVATION FOR THE VERTEX MODELS 
A. Vertex Model on Orientable Graphs 

To introduce a representation that leads to the loop expansion it is convenient to introduce simple integral calculus 
and discrete Fourier transform for functions f(cr) of an Ising (spin) variable. Note that "integrals" here are nothing 
but sums over discrete sets, introduced solely to simplify notations. A Fourier transform of f(cr) is a function j-f(jt), 
where the corresponding momentum n = ±1 is also an Ising variable. The definitions and properties of integrals and 
Fourier transform are as follows 

f(a) = a + ba; f daf{a)= ^ /(ct); j da = 2; f ada = 0; (13) 

J CT=±1 ^ J 

ff(ir) = ~ J da(l + na)f(a); f- 1 g{a) = J d<w{l + ™)g(*); 

= 1; P{&) = ^ p-\l)=2; fi~\^=2a, (14) 



Denoting 



F a {<r a ) = f a {<r a ); Fjfc) =£fj{TTj); fj(tr)= f dTTjFj(ir j )Y[(l+nja<raj), (15) 



a9j 

where, diXj = J\ a3 j ^jm we can represent the partition function in the form 



Z ov = J Ylda a d7v a MJ/vTrij [l[F a (cT a yJ \J[(1 



(16) 



Here and below in this Subsection the index order in the definition of the discrete fields is arbitrary, i.e. <ii a = o a i. 
Our derivation of the loop expansion rests on an important, yet very simple relation that can be easily verified directly: 

cosh(r7 + v)(l + 7T(t) , o 

. ' . — — — r = 1 + tanh (77 + x) - <r) (tanhfr? + x) - tt cosh 2 (?y + x • (17) 

(cosh 77 + crsmh?7) (cosh x + Trsmhx) 

Introducing two sets of parameters rj a j and Xjq, that reside in the graph edges we can make use of Eq. ()17|) to re-group 
the terms. This results in the following expression for the partition function 

Zov = Z gv j dcrdn Y[ Pj fa ) JJ P a (o"a) JJ Vj ), (18) 

j a ja 

ZffU = ^11 ° OS]l + Xja)j , (19) 

Pjfa) = Fjfa) Yl (cosh(xja) + ttj-c, sinh(xia)) , (20) 

P a (fa) = ^("j) II (cosh(r? aJ -) + <J a j smh(i] aj )) , (21) 

Vj a (a aj ,TT ja ) = 1 + (tanh(ryo,j + Xja) - cr aj ) (tanh^o,- + Xja ) - n ja ) cosh 2 (r) aj + Xja)- (22) 
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The desired decomposition is obtained by expanding the V^-terms followed by a local computation. The parameters rj 
and x are chosen using the criterion that skeletons (subgraphs) with loose ends do not contribute to the decomposition. 
This can be achieved if the parameters satisfy the following system of equations 



dirj (tanh(77 QJ - + Xja) - n ja ) Pj fa) = 0, (23) 
J der a (tanh(r/ Qj + Xja) - a<xj) P a {<r<x) = 0- (24) 
The first equation in the system, Eq. (f2"3"|) , can actually be reduced by making use of Eqs. p3M15[) and ([2H|) . to 



dtTj (tanh(f? aj - + Xja) - Vja)Pj(&j) = 0, (25) 
Pj(<7j) = fA a o) II ( cosh (Xj") + °ja sinh(xj Q )) . (26) 

a3j 

Combining Eqs. (|24I25[) we derive 

exp[(?7 aj +Xja)c r ja] = & (o«) = b ( v) Qf) 

cosh [ri aj + Xja] 3 \ " ' 

Aov) _ PjjcTj) ,(ov) _ Paj^a) ,„„.. 

where it is assumed that Oj a = a a j. Eq. ([2"T]) constitutes the BP system of equations, represented in terms of 
parameters r\ and X- Eqs. ([28]) provide the BP expressions for the probabilities (beliefs) to observe the spin vector 
associated with a bit/check in the corresponding states states. 

A typical sum/integral, needed to calculate individual marked path/diagram C contribution, is reduced to the 
following irreducible correlation functions that should be computed within BP 

fi a = I ' da a b^ v \a a ) Y[ {a la -m la ), (29) 
^ = J d(Tib\ 9v} (o-i) Yi (via-m la ), (30) 

where mi a is the BP magnetization at the edge ia 

m ia = J dcr l b i f v) (pi)a la = J da a b{° v) {(T a )o ia . (31) 

B. General Vertex Model 

We are now in a position to consider the case of a general, not necessarily orientable, graph. The loop expansion 
and the BP equations can be readily extended to this case. To derive the desired loop decomposition we relax the 
condition a a f, = aba, i-e. we treat a a b and &ba as independent Ising variables. In complete analogy with the orientable 
case we represent the partition function in a form: 

Z gv = [ daj[f a (a a )l[l±^. (32) 

J a be 

Note that for this representations the vectors cr a become independent variables. Also in the product over be we assume 
that each edge contributes only once. We further introduce a parameter vector 77 with the components r\ a b, all of 
them being independent variables. Making use of Eq. (|17|) we arrive at the following representation for the partition 
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function that is ready for the loop decomposition 

Zgv = Z gv / dcr JJP Q (er Q ) V bc (<r bc ,a cb ) ; Z gv = I JJ 2 cosh (r] bc + r) cb ) J ; 

"* a bc \ he / 

Pa{<Ta) = fa(cr a ) (cosh r] ab + a ab sinhr) ab ) \ (33) 



b£a 

2/ 



V bc {a bc ,a cb ) = 1 + (tanh(77 hc + T] cb ) - a bc ) (tanh(?7 bc + r) cb ) - a cb ) cosh (r] bc + i] cb ). (34) 
The BP equations for our general case have a form: 

J der a (tanh(j7 a6 + rj ba ) - a ab ) P a {cr a ) = 0. (35) 

To recast Eq. Q35|) in a stndard BP form we denote by r\ ab the vector with the components rj ac with c S a and c =/= b, 
i.e. r; a = (r) a b,Vab)- We also define a function j(r) ab ) using the condition 

/c^b c^b 
Y\_da ac f a (cr a ) JJ(cosh77 ac + cr ac sinh?7 QC ) = 0(cosh7 + a ab sinh7). (36) 



c£a cGa 



The meaning of Eq. (f3"6"]l is as follows. The l.h.s. of the equation is a function of the Ising variable a ab and a function 
of ?7ab (since by definition it does not depend on rj ab ). The r.h.s. constitutes a generic representation of such a function 
provided <f> and 7 are allowed to depend on 7] ab . Integrating (summing) over a ab in Eq. (|36p with and without the a ab 
factor allows to determine the function j(r) ba ) explicitly: 



tanh 7 (r/ afc ) = — ^ — — - ^ e#j) ; ; ■ — — . (37) 



/ d(T a (J b af a ((Ta) J] eg a (cosh 7? ae + g ae Smhrfac ) 

J dcr a f a (cr a ) J]cea (cosh77 ac + cr ac sinhry ac ) 
Multiplying Eq. (|3l)|) with a factor (cosh^,-,./, + er a b sinh?7 ;,) yields 

/ JJ da ca P a (cr a )(f) (cosh(7 + ?7 q6 ) + cr afc sinh(7 + r} ab )) . (38) 

Comparing Eq. (j3"5)) with Eq. (|35p we arrive at sinh(7 — r/ ba ) = 0. This allows to represent the BP equations in a 
more conventional form 

Vba = 7 {Vab) ■ (39) 

Calculated within BP, the probability of finding the whole family of edges connected to node a in the state <x a is 

J d(T a P a ((T a ) 

In the general vertex model case a typical integral (sum), needed to take to calculate a diagram contribution for a 
generalized loop C, is reduced to the corresponding irreducible correlation functions of the spin variables computed 
within BP: 

fi a = f da a b{ 3V \a a ) J] (a ab -m ab ), (41) 

J bea.C 

where m ab is the magnetization at the edge (ab) calculated within BP 

m ab = J d<T a b a 9V \<r a )a ab . (42) 

The final and most general expression Eq. © emerges in the result of direct calculation of the (generalized) loop 
contributions making use of Eqs. (|33I39I41I42[) . 
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III. LOOP SERIES DERIVATION FOR THE FACTOR-GRAPH MODEL 



A. Integral representation for the Factor-graph model 

We aim to derive a convenient integral representation for the statistical model ([TJ. As a first step we introduce 
two statistically independent sets of discrete random variables: the original {<7i}, and the additional factor- variable 
counterpart {7r Q }, where each Tr a is a vector consisting of q a scalar components, each a discrete random variable, 
and q a is the degree of connectivity of the corresponding factor node. For the example represented by Fig. |1| we 



have 7r a = {■n y a l , n^'), n b = (ir^ , 7r£ 2) ), ir c = (wc 
representation the partition function of Eq. fl} can be rewritten as 



J 3 ) 



(4)\ 



fa 



a,b,c,d 



H/afaa) 



n 



a3i 



= =bl. Using such a 



(43) 



where the product over i is taken over the bits connected to more then one factor nodes. Under condition that all 



discrete scalars 7r« belong to the binary alphabet the expression on the rhs of Eq. (|4"3"j) can be rewritten as 



E 



exp 



- 1 



a9z 



1-g* 



(44) 



where > 1 is the degree of connectivity of the bit i, and Xi is a vector with the components Xiat where a 3 i. 
Integration goes over a gj-dimensional cycle Cj = Y\ a3i Ci a that constitutes a cartesian product of contours in the 
complex plane: C ja , connects the points Zj a and Zj a + 2iri (qj — 1) in an arbitrary way however such that the contour 
does not go through the point of formal singularity of the integrand in Eq. (|44p . It is straightforward to check that the 
result does not depend on the particular choice of the reference points zi a . The multidimensional integration contour 
can be defined in way (in the sense of passing the multidimensional pole manifold) that the integral representation is 
exact, yet its deformation that reaches the saddle point does not involve the pole manifold. This is confirmed indirectly 
by identical exact loop expansions that originate from the integral representation and it discrete counterpart. Note 
also that the integral representation is obviously not unique and the specific choice of the representation is dictated by 
our desire to find one that guarantees emergence of the BP in the saddle-point approximation applied to the integral. 
Below in Section fill CI we will verify, indeed, that Eq. (|4"4"| obeys the desired saddle-point property. 
Substituting Eq.([l4")) into Eq. (|4*3")l one derives 



nn^ 



n 



exp 



\ ct3i / . 



n 



/afa a ) exp 



(45) 




ex P [-Q (x)] 



nn^ 



exp [-So (X)]- (46) 



B. Relation to the Bethe variational approach 

The expression of Eq. (|43)l is compact and already constitutes a good starting point for further, e.g. saddle point, 
analysis. Meantime, for the purpose of establishing a relation to the Bethe free energy approach of Q and for some 
further applications we introduce the following auxiliary integrations 



YlY\d(pi(<7i)d(pi(ai) 



exp 



E E ^fa*) ( ^fa*) ~~ ai E Xta ) 

E E ^"fa") ( V'afaa) + ln/ a fa a ) + ^ ^ Xia j > 



(47) 
(48) 
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in the rhs of Eq. (|45[) . After some obvious manipulations we arrive at 



nip*. 



]Q Y\_dip l (a i )dip i (a l ) 



Y[ H dlfra (fa ) <#a (Tq ) 



exp [—5] , 



5 = E 



V5i(<Ti) ^i(cri) - a; ^2 Xta) + (ft - 1) In ( E cxp 



a3i 



E 



53^"( 7r ") ^(TTg) + ln/ a (7r g ) + ^ 7T^ 



Xi- 



ft - 1 

In E eXP ( _ ^a(7Ta)) 



Evaluating the integral over ipi{ui), ip a (ir a ) within the saddle-point approximation we obtain 

exp (0i)/(ft — 1)\ f f t \ \ 

H<n) = Vt^ ~\ = K-V'exp M^Vo/C* - 1)) , 



exp i 



' -r J w (sp) , , 

Eexp(-^ SP) (7T Q )) V 



■4 SP) (7Ta) 



(49) 



(50) 



(51) 



(52) 



Expressing tpi(<7i), ip a (j^a) m terms of tfi(<Ji), ip a (ir a ) according to Eq. (|51I52|) and substituting the result into the 
effective action (l50l) we find 



5 (sp) = _^^Vi Q (^ Q )ln/ Q (7r Q ) 



EE^E^ I Vi(^) - E ^«(7ra) ] -53 inz « sp ' ) +E(* ~ i ) ln 



>p) 



(53) 



7T a \(Ti 



The saddle-point (in ip and y>) solution (|51l52p is highly degenerate: there is a freedom in imposing a constraint per 
any bit i and per any factor-node a. Moreover, the integrand in Eq. (|49l50p is invariant under the transformations: 



i>a(Ka) -> 1pa(n a ) + C a , lfi(<7i) -» <fi(<Ti) + Ci 



(54) 



Fixing the values of ^ <fii(<Ti) and J^-r ^ai^a): introducing the shifts (fi>4")) into Eq. (|49l50p and integrating with 
respect to Ci,c a one arrives at the normalization constraints 



E <Pi( a i) = 1 ' E ^( 7F «) = 



(55) 



that are dynamically imposed, i.e. they are present in the integrand of Eq. (|49p as products of the corresponding sets of 

((-functions. A convenient choice of ^ <Pi(0i) and ^ ip a (^a) constraints is the one that makes = Za^ = 1. 
As a result the last two terms on the rhs of Eq. (p)3")) disappear and the equivalence between the effective action ([55)1 
and the Bethe free energy of Q (see also Appendix [XJ becomes clear. 



C. Belief-propagation as a saddle-point 

Looking for the saddle-point configurations of the auxiliary fields (pi(<Ji), i/> a (""«)> tpii&i), ^a(^a) an d x%a that 
dominate the contribution to the integral in Eq. (|4"9"]) . and thus setting the corresponding partial derivatives of S to 
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zero, we obtain in addition to Eqs. (|51I52|) the following saddle-point equations 



(56) 
(57) 
(58) 



This system of Eqs. (|51l 



is identical to the BP system of equation derived via variation of the Bethe free 
energy 0] (see also Appendix [A"]) . The relation between the corresponding fields is as follows: (pf^ {oi) «-► &i(<7j), 

E/saTw- Normalization constraints (|A2[) are obviously satisfied 



Aia («7t), Xia 



in Eqs. (|51l52p . The consistency constraint (|A3|) is equivalent to ([5 
Eqs. (|51I52I56M58[) results in 



£ir a V< 6XP (Sjec 'xja ) ex P E/3 3 i X^ - 1 



E,r Q /«W ex P (Ejec T» ^j^) E CT , ex P E 



v (sp) /Co- 



(59) 



that can also be derived directly from Eq. (j4l)|) . Left-hand-side or right-hand-side of Eq. (|59p gives saddle-point, BP 
expression for (pi{(Ji) - the probability to observe spin at the bit i in the state Ui. 

The set of Eqs. (|51I52I56B5|) coincides with the one derived as an extremum condition for the Bethe-Free energy 
(see also Appendix \K§ . Iterative solution of these nonlinear equations reproduces the famous Belief Propagation 
algorithm for efficient yet sub-optimal solution of the inference problem. 

The saddle-point approximation for the partition function 



exp 



-So [X 



/Op) 



(60) 



is expressed in terms of the effective action <So defined in (I45I46[) . Note, that there may be more than one realizable 
(correspondent to real valued x^') solution of the saddle-point (BP) system of equations. 

For the purpose of further applications let us also introduce the magnetization and irreducable correlation function 
(defined at two bits neighboring the same check), both defined within the saddle-point BP approximation 



m = y"V t <^ sp) (o-j), 

for i,j S a: faj = ^(cri 



mi)(aj -mj)^ { a sp) (a a ). 



(61) 
(62) 



D. Gaussian correction to the saddle-point approximation 



To calculate the correction to the zero-order saddle-point approximation we need to expand the effective action Sq 
in Xia around x^ sp ' ) to the second order. According to the definition of the saddle-point the first-order term in the 
expansion is exactly zero. If the second-order expansion is sufficient, i.e. the higher-order terms are much smaller with 
respect to some parameter (the exact origin of the expansion parameter will be verified and discussed later) , we shift 
the multi-dimensional integration contour that enters Eq. (|4"6"|) in the space of complex x-fields to go exactly through 
the saddle-point. The next task is to calculate corrections that originate from the vicinity of the saddle-point. At 
the saddle-point we choose the local orientation of the integration contour in the steepest descent way. Note, that 
the steepest descent at a saddle-point may go along imaginary or real direction. Finally the integral in Eq. (|4"6"]) is 
approximated by a Gaussian integral. The result of the Gaussian integration in Eq. (|46[) . which leads to a correction 
the saddle-point term, becomes 



exp [S - S%] , Si 



1 



In 



det 



A 



A M;j73 (x tSp) ) 



(sph = d 2 £o(x) 



dXiadXj/3 



ASP) 



(63) 



where all the expressions are taken at X^ sp ^- 
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Calculating the matrix of the second-order derivatives directly from Eq. (|45|) one arrives at 

V tea: A M;M ( X (Sp) ) = ^-f [1-^], (64) 

V a,/3 3i, A la .Mx (SP) ) = ( 65 ) 

V i,i e a, * : A M;JQ (x (sp) ) = (66) 

where all matrix elements between the pairs/links {i,a} and {j,(3} sharing neither a common bit nor a common 
factor/check node are zero. In Eq. (|64I65I66[) m, and fj,ij are the magnetizations and irreducible correlation functions, 
respectively, that are calculated within the saddle-point (BP) approximation [see Eqs. (|A12I62|) ] . Direct calculation of 
the Gaussian integrals based on the problem-specific information on the BP saddle-point solutions and the quadratic 
form matrix (|64I66[) . the latter also depending on the bare saddle-point solutions, provides a straightforward alge- 
braic way of computing the partition function, magnetization, and any other spin-relate objects within the Gaussian 
approximation. 

A potential difficulty in the evaluation of Gaussian integrals originates from a relatively complex structure of the 
matrix A. Specifically, the off-diagonal (|66p term proportional to the irreducible pair correlation function induces 
coupling between different blocks related to the corresponding bits. In the following Subsection we analyze the 
Gaussian integrals perturbatively by expanding the off-diagonal term in an infinite series. 

We will show that the only surviving terms in this expansion, after the Gaussian integrations are performed, 
correspond to loops inn the factor graphs. We will demonstrate that the actual expansion parameter is the product 
over the loop of the terms /%/(! — ntf), evaluated at the saddle point. 



1. Loop expansion in the Gaussian case 



We start with introducing a convenient notation for the Gaussian integration 

n 



M>bd = 



I 76P 



/ ( I! d C P1 ] exp -Kn/3 A SJrCp7 



(67) 



\P,7 



where A^d) j s the block (bit)-diagonal part A^ ^ of A defined by Eqs. (164166)) with the off-block-diagonal part of A, 
described by Eq. (f65|) being ignored. 

An important object Pi a ;j0 = (CiaCjp)bd ^ n ^ ne expansion with respect to A^ ^ is will be refered to as a propagator 
following the traditional physics jargon of the Feynmann diagram expansion. It follows from Eq. (|64l65p that the only 
non-zero component of the propagator is represented by 



a 7^ 0, a,J33i: P iQ;l/ 3 



1 - mi 



(68) 



Note that the propagator has an interesting "fermionic-repulsive" feature: for a fixed bit i it is strictly zero for 
coinciding factor/check indices, i.e. Pi a -ia = 0. In addition to the "propagator", the off-block-diagonal term is 
represented by a "vertex" . The vertex term is nonzero only for 



i 7^ 3; i, j S a : V ia -ip = fiij. 



(69) 



It is also convenient to introduce a graphical notations for both the propagator and the vertex, (see the left part of 
Fig. E]). 

The correction to the partition function adopts the following form 



Z g = ( exp 



K A (off) t 

2 l 5«P il n/3;p7 l 5P7 



bd n=0 



2 n n\ 



f A (Off) , 



(70) 



bd 



where the full partition function of the model is approximated as Z « ZqZ q . Each term in the sum on the rhs of 
Eq. (|70[) can be represented by a diagram. For an n-th order term the diagram contains n-vertices. The Gaussian 
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FIG. 2: Gaussian approximation about the BP saddle-point. Diagrams for the propagator and the vertex are shown on the left. 
The right plot illustrates a loop contribution. The example corresponds to a loop in the model shown in the upper left corner 
of Fig. ([T}. A leg of a vertex (dashed line) should pair with a leg of a propagator (solid line). No unpaired legs are allowed. 



integration that corresponds to each term is performed in the following way. We first consider all possible Wick 
decompositions of the product of In £-terms in n pairs. Each pair in the product results in the corresponding 
propagator. The n-th order term on the rhs of Eq. (|70| naturally decomposes into a sum of n(n — 1) terms each equal 
to a product of n-propagators and n-vertexes. The key observation is that only very few of the terms survive due 
to the specific structure of the propagators ([75)) and the vertexes ([73|> . Indeed, only those terms do not vanish that 
consist of the propagators and vertexes, coupled through their legs and forming a loop in the model factor graph. The 
structure is illustrated in Fig. [2] It is common for the Feynman diagrammatic techniques that the natural object is the 
logarithm ln(Z g ) of the partition function, since only connected diagrams, i.e. the ones that cannot be decomposed 
into a product of other diagrams, contribute to the object. This results in 

_ n m« 

W-Xt^zw Pi) 

where loops C are defined as closed directed self-avoiding paths in the model factor graph that pass from a bit to a 
factor/check and then from a factor/check to a bit, etc, in such a way that returns from a check that belongs to the 
path back to the preceding bit are not allowed. An example of a loop is shown in Fig. [2] 



E. Loop calculus via Integral Representation 

The Gaussian fluctuations analysis is justified only if the higher-order (third-, fourth- , etc) corrections to the 
Gaussian approximation are small compared to the major saddle-point and Gaussian contributions, and the expan- 
sion is controlled by some parameter. Jumping ahead we know that the loop expansion exposed by the Gaussian 
approximation is the correct one, in the sense that the connected loops contributions (no branching) provides the 
leading correction with respect to the branching parameter. However, to see how this general loop expansion actually 
works we need to expand the effective action to all orders around the saddle point and classify an infinite number of 
perturbative terms, which seems a nightmare. 

Fortunately, there is a way out of this technical problem that allows to account for all-order corrections simultane- 
ously. The method is based on introducing a set of new variables Q a = Xia — followed by explicit decomposition 
of the integrand in Eq. (|45p as a product of two non-Gaussian (with respect to the fields Q a ) terms, which are diagonal 
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in the bit and factor/check representations, respectively: 



Z 



P 



nn^ 



II^(c;x (sp) )n^(c;x (sp) ), 



n 

Oi3i 



E/a(71"a)exp J2 ^aX^a eX P U^Cio) 



\a3i 



(72) 



(73) 



E^ J 

7T Q \ j£a 



(./) 



,( s p) 



n 



E/a^cOexP E 



(j) Op) 



exp 



1 n a <,ia 



(74) 



Here we introduced the same factor to the numerator of P and denominator of V, respectively, that is local both in 
bit and check representations. The rational behind such a decomposition choice is to ensure that in the of case of 
the Gaussian approximate perturbative analysis of the effective action all correlations within a block associated with 
the same bit are all included in the "propagator" term P, while the inter-bit interaction appear only in the "vertex" 
counterpart V. 

We further introduce a set of convenient notations that will allow to substantially simplify calculations: 



V^/J: (A{n a )B(irp)) v = (A(n a )) Wa (B(ir )) 



E^(7Ta)/a(7Ta)exp ( £ « X 



(i)Abp) 



"» E fa(n a ) exp E To X 



A S P) 
via 



(75) 



(76) 



Using the notation, the "propagator" and "vertex" terms can be recast as follows 
Pi = 



J] [cosh ( ia + m l sinh Q, 



cosh I E(x i ( : p) + C M )/(*-i) 



9,-1 ' 



V a = 



rrL (Trj 1 - 1 - mi)tanhCm 
AM 1 + tanh Cm 




(77) 



(78) 



Of course, the number of terms in the series will grow exponentially with the size, very much like in the original 
formulation of the problem. However, wise classification of the terms followed by selecting (and calculating) a small 
number of relevant terms allows not only to extract the BP approximation, but more importantly the leading-order 
corrections to BP. The leading-order corrections/terms will be associated with shortest loops on the Tanner graph, and 
this transparent geometrical interpretation will be coming through diagrammatic representation of the perturbative 
terms. Note, that the diagrammatic technique we develop here is of a special kind. The major peculiarity of the 
technique is the non-Gaussian form of the P-term in Eq. (|74p . Our approach is technically reminiscent of the 
celebrated Vaks-Larkin-Pikin approach [24| . used to calculate non-perturbative corrections to ferromagnetic ground 
state in magnets. The reference is not precise as it only means to emphasize a vague structural relation of our method 
to the one introduced in the classical paper *24| where the "propagator" P-term was also non-Gaussian. 

We are now in a position to discuss a typical structure of the integrals over £ for individual terms (diagrams). 
We notice that an individual integral over all possible £ variables always decomposes into a product of independent 
integrals, each over the block of variables related to a bit. The simplest integral corresponds to a bit with all edges 
connected to it being uncolored: 



la-.i 



JJ dXiaPi 



a3i 



i 



exp 



■ V Y (SP) 



l 



exp 



Asp 
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where one uses the saddle-point relation 



1 + m, 
1 - m s 



(?«-l)/2 



(80) 



Combining Eqs. (|79I80[) and substituting the result in Eq. (|72[) we obtain for the first term in the series for the 
partition function 



Z 



a 7r a \i£a J 



1-9, ' 



which exactly reproduces the aforementioned saddle-point result. 
For the general p-order term one arrives at 



J a3i 1=1 V 



t arm £ ia! 



rrii tanh ( iol , ) 



exp 


- V Y iSP) 




a3i 



2"- 



(l + mi ) qi - p + (-l) p - 



exp 


V Y {SP) 




_a3i 



2"- 



(1-m,) 9 



(l-m^- 1 + (_l)P(l + mi ) 



p-i 



(81) 



(82) 



2(l-m?)f-i 

The resulting expression for the entire series derived directly from Eqs. (|72I77I78[) and Eq. (f8"2")l is given by Eqs. (|8I9I10[) 



IV. CONCLUSIONS 



We conclude with presenting a brief outline for our ongoing and future research activities on the way of extending 
the loop calculus detailed in this paper. 

Gauge invariance of the vertex models have been discussed above in the context of two specific representations we 
utilized to derive the loop series formula. However, this important notion allows more universal and mathematically 
accurate formulation. For this purpose, it is convenient to introduce the notion of a graphic tensor and corresponding 
graphic trace (convolution). The graphic trace concept generalizes the standard (in statistical mechanics) transfer 
matrix approach to the models defined using arbitrary graphs. This allows to formulate the loop expansion, the BP 
equations, as well as the Bethe free energy, in a gauge-invariant form. The loop expansion becomes nothing more 
than a representation of the partition function as a sum over all possible configurations using a special BP gauge. The 
graphical trace and the unifying gauge-invariant approach will be discussed in details elsewhere (25j . The universal 
formulation allows for a natural and straightforward extension of the loop calculus to more general statistical models 
that operate with non-binary alphabets (e.g. Potts model on a graph). This general problem will also be discussed 
in 

More generally, we anticipate that the loop calculus can be extended to any classical models residing in graphs 
that are formulated in terms of continuous fields of Abelian and non-Abelian origin, e.g. 0(2) and 0(3) models, 
respectively. Moreover, the approach should also work for Quantum models and fields, e.g. quantum Heisenberg 
model on a graph. The latter may be of substantial interest for developing new approaches in quantum information 
theory. 

Loop series offers an exact representation for the partition function, and also correlation functions, that can be used 
for improving approximate algorithms. This should be understood as follows. Many problems in statistical physics, 
information and computer sciences are intractable, in the sense that the number of steps required to accomplish a 
computation (i.e. to calculate an observable) grows exponentially with the system size. Then the issue of an approx- 
imation and related approximate computational algorithm emerges. Development of a sequence of approximations 
with gradually increasing complexity becomes an important task. On the one hand, the higher is the term in the se- 
quence, the better it approximates the full answer. On the other hand, the complexity should be linear or polynomial 
for at least some number of low-order terms. Given that the first term in the loop series is the BP term, which is 
known to constitute already a very efficient approximation/algorithm, one can use the higher-order loop corrections 
as a regular way of the BP-algorithm improvement. 
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The loop series also introduces an explicit BP-measure on the graph: any loop contribution can be expressed in 
terms of local objects, magnetizations and irreducable correlation functions calculated within BP. Therefore, looking 
for individual loop contributions that dominate the correction to the bare BP approximation constitutes a particularly 
attractive and computationally feasible strategy [26[ . As a side remark we note that various options are available for 
calculating the bare BP contribution and estimating the magnetizations and irreducible correlation functions within 
BP. First of all, one may use the original iterative algorithm of Gallager . Linear programming approach of [27| is 
another very attractive possibility at large signal-to-noise ratios in the cases when the iterative BP does not converge. 
Finally, one may develop an iterative relaxation algorithm that is guaranteed to converge to a true minimum of the 
Bethe free energy [28| . Such an algorithm is expected to perform better than the linear programming algorithm at 
finite temperatures. 

We anticipate the proposed scheme to work very well in many cases, especially for the graphs that are locally 
tree-like. The models with long loops emerge naturallyin the context of decoding of LDPC codes [1, HH, Q3 and also 
in if-SAT satisfiability problem in computer science flU HEI. HH] . 

The loop series can also be very useful for theoretical analysis of problems with disorder [29}, e.g. of the random 
graph type [13, [HI]. The goal here is to calculate the loop corrections to various disorder-averaged correlation 
functions. A particularly interesting question is to differentiate contributions that originate from loops of different 
sizes. Depending on the regime one may expect either dominance of some limited number of shortest loops, or a 
distributed effect of many loops. Thus, for the Viana-Bray model [30| . which contains a large number of short loops, 
considered in the high-temperature regime, the latter possibility was reported in the formal 1/N replica expansion 
[l7l ]. The leading correction to the BP expression for the averaged free energy is dominated by a combined effect of 
many long loops. Further analysis of this and other models, especially the ones corresponding to expurgated ensembles 
of random graphs modeling LDPC codes with large girth [32], is required to clarify the statistical role of loops of 
different lengths. 

Finally, we are optimistic about using the loop calculus developed in this paper for further analysis and algorithmic 
exploration of the standard lattice models, i.e regular structures with many short loops. A particularly interesting yet 
challenging direction of research would be using the loop calculus, which naturally differentiates the loops of different 
sizes and shapes, for analysis of the critical point behavior in the lattice models. 

We are thankful to M. Stepanov for many fruitful discussions. The work at LANL was supported by LDRD program, 
and through start-up funds at WSU. 



In this Appendix we reproduce a derivation of the Belief Propagation equation based on the Bethe Free energy 
variational principle, following closely the description of 0], e.g. translating it to our notations. We describe the 
Bethe free energy approach for the factor graph model and general vertex models in the two subsequent Subsections, 
respectively. 



In this approach trial probability distributions, called beliefs, are introduced both for bits and checks hi and b a , 
respectively, where i = 1, • • • , N, a = 1, • • • , M. Each belief depends on the corresponding spin realization. Thus, a 
belief at a bit actually consists of two probabilities, and ), and we use a natural notation &j(<x,-). There are 

2 k beliefs defined at a check, k beeing the number of bits connected to the check, and we introduce vector noatation 
tr = (ai t , • ■ • T<Ji k ) where i±, ■ ■ ■ ,ik € a and cr, = ±1. Beliefs, as corresponding probabilities satisfy the following 
inequality constraints 



APPENDIX A: BETHE FREE ENERGY 



1. Bethe Free energy for the factor graph model 



< bi((Ti),b a (* a ) < 1, 



(Al) 



the normalization constraints 




(A2) 



as well as the consistency (between bits and checks) constraints 




(A3) 
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where <r a \<Ji stands for all possible configurations of aj with j e a, j / i. 

The Bethe Free energy is defined as a difference of the Bethe self-energy and the Bethe entropy, 



F B , 



the 



u B , 



the 



the 



defined as 



U Bethe = ~ ^ ^ b a (<T a ) ln/ Q (<T Q ), 

H B ethe = - } J K (<7g ) hi K (<T a ) + ~ !) / , h{vj) hi bj{<Jj), 



(A4) 

(A5) 
(A6) 



where cr a — (c^ , • • • , cn k ), ii, ■ • ■ ,ik £« and <jj = ±1. The entropy term for a bit enters Eq. (|A4[) with the coefficient 
1 — qi to account for the right counting of the number of configurations for a bit: if all entries for a bit (e.g. into the 
check term) are counted the total counting should give +1 for the bit. 

Note, that the definition of f a according to Eq. (|A5[) is not unique. A convenient choice of the factor function 
describing an LDPC code would be 



f a {cr a ) = exp ^2 hmlqi U f[ff„l 

\iEa / \i£a / 



(A7) 



Optimal configurations of beliefs are the ones that minimize the Bethe Free energy (|A4[) subject to the constraints 
(IA1IA2IA3|) . Introducing the constraints as the Lagrange multiplier term to the effective Lagrangian 

L = F Bethe + 7a I^Mer a )-l) + ( X) 6 ^) ~ 1 ) + J2J2J2 X ' a( - cr ^ & «( <t «) ( A8 ) 

a \iTo / i \ c» / i a3i en \ rr a \at I 



and looking for the extremum with respect to all possible beliefs leads to 

SL 



Sb a ((Ta) 

SL 
Sbi(a-i) 







b a ((T a ) = / Q (er Q )exp 



= 



= exp 



V a3i ) 



(A9) 



(A10) 



Eqs. (|A9|A10j) complemented by the normalization and consistency constraints (|A2|A3|) form a close system of BP 
equations for the \t a variables. In the LDPC case, described by Eq. (|A7[) . BP equations are traditionally written in 
terms of 77-ficlds defined on the edges according to 



_ Aj Q (+) — Aj Q (— ) hi 



(All) 



Substituting Eq. (|A11|) in Eqs. (|A9|A10p one arrives at the following set of equations for the magnetization at a bit 
derived in two different ways 



<Tib a {cr a ) = tanh I rfc, + tanh 1 J^J tanh 77.,' Q 



1 j€a 



^ (Tibi{(Ti) = tanh 



Ea 3i (^»(+)-Afa(-)) 



(A12) 
(A13) 



2(<?i - 1) 

Equating the right hand sides of Eqs. (|A12IA13|) . using Eq. (IA11[) and making some simple manipulations one derives 



0¥><* 

hi + tanh -1 



Y\_ tanh rjjp 



(A14) 



that is the BP system of equations for LDPC codes written in its standard form. (These equations are often described 
in the coding theory literature as stationary point equations for the sum product algorithm.) 
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2. Bethe free energy for the general vertex model 



The variational approximation for the model that generalizes the factor graph case discussed in Appendix IA 11 reads 
as follows. One minimizes the following Bethe free energy 

(b (cr ) \ cea 
ft ~\ )~ y^y^ b ac (a ac ) In b ac (a ac ), (A15) 

with respect to b a (cr a ),b ac (cr ac ) fields under the conditions 

Va,c;c£a: < b a (er a ), b ac {a a , c < 1, (A16) 
Va,c;c£a: }^ b a (er a ) = }^ b ac (a atC ) = 1, (A17) 

ya,c;cea: b ac {a ac ) = ^ b a {cr a ) = ^ b c (<r c ), ( Al 8) 

where as usual we assume a ac = cr ca . The second term on the rhs of Eq. (|A15[) is the entropy term which takes care 
of the "double" counting" of the link contribution: any link enters twice in the entropy part of the first term on the 
rhs of Eq. (tAl5|) . 

Extension of these formulas to the orientable vertex model case is straightforward. It is achieved by partition- 
ing the entire family of vertices {a} into two sub- families {i} and {a}. After that one just needs to replicate 
Eqs. (|A15IA16IA17[) in the bit and check versions respectively, while Eq. (|A19[) adopts the following form 

Vi,a; iea: bi(cr j) = b a (cr a ). (A19) 

Furthermore, considering the case of the orientable vertex model and substituting a particular form of the fi(cri) 
correspondent to Eq. ((4]), we finds that Eq. (|A15[) turns into Eq. (|A4IA5IA6[) under a natural substitution 

6 l(<Tl ) = ( fc(°-0> <*« = *W Va,/3 3 i (A2Q) 
v ' 0, otherwise. v ' 



APPENDIX B: SINGLE LOOP EXAMPLE 



This Appendix serves an illustrative purpose: We show directly how the loop formula (|8|) works for a simple example 
of the factor graph model ([T]) with a single loop (two bits and two checks, see Fig. [3]). For this simple model the 
belief-propagation equations (|A9IA10|) adopt the following form 

/ q (cti, < J 2)d° 1 J 2 d 2 T2 J 2 

for a, (3 = a, 6; f3 ^ a : b a (ai, a 2 ) = , ,„ , ,„ , (Bl) 



/V*/ 2 



for i = 1,2: fcfo) = - " " /2 (B2) 
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where the factor functions /a(ci, 02), fbi&x,^), defined for 01.2 = ±1 are considered to be arbitrary. Eqs. (|B1IB2|) 
complemented by the consistency conditions (|A3[) are reduced to the set of quadratic equations that can be solved 
explicitly yielding 



dla = -/,(-, -)M~, -) ~ fa(~, +)/&(-+) + /«(+, -)fb(+, ") + /«(+, +)fb( + , +) 



4 -)/«(-, +) + /*(+, -)/«(+, +)) (/„(-, -)/ 6 (- +) + /„(+, +)) 

(/.(-, -) - /„(-, +)M~, +) + /a(+, -) - /.(+, + ))* 

X [2(/ a (- -)/ 6 (+, -) + /„(-, + )/ 6 ( + , , 



1/2n 



(B3) 



where the BP expressions for dib,d 2 a and d 2 b can be derived making proper permutations of indices and arguments 
in Eq. (|B3j) . Using these solutions we arrive at the following expressions for the partition function calculated within 
the BP-approach: 



2fi 



n,E CTi c /2 « /2 



1 



= o /«(" -) + /»(",+)/&(-, +) + /a(+, -) + /«(+, + )/*(+, +) 



+ 4(/ 6 (- -)/„(-, +) + / fe (+, -)/„(+, +))(/„(-, +) + /„(+, +)) 



-(/«(-, -)/&(-, -) - /«(", +) + /«(+, -)/*(+, -) - fa(+, +)f b (+, +)) 2 



1/2 N 



(B4) 



Bit magnetizations as well as irreducible correlation functions at the checks are found upon direct substitution of (|B3 
and similar expressions for the other d-variables in terms of the factor-functions into 



i = l,2: mi = a l b l (a i ) 1 

a = a,b: [j, a = J~] (g x - m 1 )(a 2 ~ m 2 )b a (a 1 ,a 2 ). 



(B5) 
(B6) 



Substituting these results, together with Eq. (|B4|>. into the loop expression Eq. (jHJ) for the model partition function 
we obtain 



Z = Z Q [1 



(1 — raf )(1 — m^) 

/«(-, -)/(,(- -) + /a(- +) + /a( + , -)/&(+, -) + /«(+, +)fb(+, +), 



(B7) 



which coincides with the exact expression for the model partition function that can be evaluated directly. 
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